Picture for Pheng-Ann Heng

Pheng-Ann Heng

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon

Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection

Add code
Apr 21, 2025
Viaarxiv icon

Synergistic Bleeding Region and Point Detection in Surgical Videos

Add code
Mar 28, 2025
Viaarxiv icon

Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting

Add code
Mar 18, 2025
Viaarxiv icon

SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation

Add code
Mar 18, 2025
Viaarxiv icon

UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation

Add code
Mar 17, 2025
Viaarxiv icon

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Add code
Mar 13, 2025
Viaarxiv icon

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems

Add code
Mar 13, 2025
Viaarxiv icon

MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models

Add code
Feb 28, 2025
Viaarxiv icon

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Add code
Jan 23, 2025
Figure 1 for Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step
Figure 2 for Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step
Figure 3 for Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step
Figure 4 for Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step
Viaarxiv icon